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5 Field o£ the invention 

. The present invention relates to techniques for 
evaluating download performance of web pages, such as 
times involved in downloading web pages. 

The invention was developed by paying specific 
10 attention to the possible application to mobile 
telecommunications networks such as GPRS {General 
Packet Radio Service) and UMTS (Universal Mobile 
Telecommunications System) networks. Reference to this 
preferred field of application is not to be construed 
15 as intended to limit the scope of applicability of the 

invention. 

■ Description of the related art 

Download times of web pages in networks such as 
GPRS and UMTS networks are affected by a number of 
20 factors. In addition to network performance, the 
characteristics of each web page, such as the number 
and the dimensions of the objects comprised on the 
page, and the browser type used for downloading are 
other factors that come into play in determining 
25 download performance of web pages. 

A number of techniques are already available with 
Internet service providers (ISPs) or content providers 
(CPs) in order to verify in a non-intrusive way 
(essentially by way of simulation) the expected 
30 performance of services offered to clients. 

For instance, the NetPorecast Report 5055 entitled 
"Understanding Web Performance" by T. Sevcik and J. 
Bartlett, which is an expanded version of an article 
with the same title first published in Business 
35 communications Review. October 2001, includes a review 
of commercial products adapted for determining download 
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times of certain web pages, A definition is also 
provided of certain parameters that can be adapted to 
those commercial tools in such a way to permit 
simulation of download times. 
5 A basic disadvantage of such prior art techniques 

lies in that they not take into account the role played 
by certain variables such as: 

- the type of browser used for downloading, 

- the types of web pages considered, 

10 - the specific performance level of certain 

network services being used, for instance in terms of 

available bit rate and/or latency. 

In fact, these variables are essential in 

determining the response time of a network such as a 
15 GPRS or UMTS network to the request for a certain page 

to be downloaded, indicated throughout this document as 

web pages. 

Furthermore, such prior art techniques fail to 
take into account the relationship existing between 
20 notional channel capacity available in terms of bit/s 
and the payload capacity actually available. 
Object and summary of invention 

The object of the present invention is thus to 
provide a technique for predicting ctownload times that 

25 may lead to accurate results and that also lends itself 
to be adapted to the specific characteristics of the 
services provided by a determined service and/or 
contents provider. 

According to the present invention, such an object 

30 is achieved by means of a method having the features 
set forth in the claims that follow. The invention also 
relates to a corresponding system as well as to a 
coinputer program product directly loadable in the 
memory of a cotrputer and including software code 

35 portions for performing the method of the invention 
when the product is run on a computer. 
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A preferred embodiment of the invention evaluates 
download performance parameters of web pages accessible 
via a network by providing at least one model for 
predicting a set of download performance parameters for 
5 said web pages as a fxinction of a respective set of 
input parameters. The at least one model includes at 
least one optimisation parameter (X) . 

Such a model may typically comprise a module for 
evaluating the sum of: 
10 - at least one first factor determined 

analytically on the basis of network and web page 
parameters , and 

- a second factor being a function, preferably of 
the hyperbolic type, of an optimisation parameter. 
15 A set of sample web pages is defined and said set 

of download performance parameters for the sample web 
pages are both measured and evaluated on the basis of 
the model for different values of the at least one 
optimisation parameter. An error indicative of the 
20 difference between the download performance parameters 
for the sample web pages as measured and as evaluated 
on the basis of said model, respectively, is defined 
and an optimised model is selected including a value of 
the least one optimisation parameter minimising the 
25 error or. reducing it below a predetermined value. 
Download performance parameters for any selected set of 
pages accessible through the network (N) can then be 
evaluated without interfering with operation of the 
network on the basis of the optimised model. This is 
30 done (in a non-intrusive manner, i.e. without 
interfering with operation of the network) by way of 
prediction on the basis of the selected model. 

Preferably, the set of download performance 
parameters . includes at least one parameter selected 
35 from the group consisting of download time for a given 
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web page and an efficiency index indicative of how said 
given web page exploits the capacity of the network. 

Still preferably, the prediction model is based on 
at least one parameter selected out of the group 
5 consisting of the throughput of the network, the round 
trip time (RTT> of the network, and at least one of the 
type and dimension of each object included in the web 
pages considered. 

In a particularly preferred embodiment of the 
10 invention, the model corresponds to the relationship: 

where t is the total download time of the page, n 
is the number of objects therein, d is the average size 
of these objects, b is the throughput of the downstream 

15 link (downlink) , h is the dimension of the HTTP 
headers, 1 is the network RTT and X is a free parameter 
to be optimised, namely the parameter whole value 
identifies the ^'optimum" model used for evaluating 
download performance prediction within a plurality of 

20 available models corresponding to the general 
relationship reproduced above. 

Specifically, with the technique described herein, 
the response times to be expected ^'during downloading 
can be accurately simulated for each service provider 

25 or contents provider without interfering with operation 
of the network. Additionally, an efficiency index can 
be defined representative of the amount each web page 
effectively exploits the capacity of the respective 
network . 

30 The solution described herein gives rise to an 

architecture cuid an arrangement that permit both the 
download times and the efficiency index related to a 
certain web page to be predicted starting exclusively 
from the number and dimensions of the objects comprised 

35 on the web page in question. 
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The main advantage of such an architecture lies in 
that it permits the download times and the efficiency 
index to be evaluated (i.e. estimated) for a large 
number of pages based on an optimised model identified 
5 via* measurements carried out on a relatively small set 
of sample pages . 

An extensive database can thus be rapidly created 
which is adapted for generating statistics related to 
the typical surfing speed as perceived by the user of a 
10 network such as GPRS/UMTS networks. 

In the presently preferred embodiment , the 
architecture in question includes essentially two 
categories or groups of elements, namely: 

- those elements adapted for carrying out '^in the 
15 field" measurements on the network (GPRS and/or UMTS 

for instance) with reference to a set of sample pages, 
thereby permitting identification of a corresponding 
optimum model , and 

- those elements that exploit the model thus 
20 identified for evaluating purposes i.e. for generating 

predictions . 

Brief description of the enclosed drawings. 
The invention will now be described, by way of 
example only, by referring to the enclosed figures of 
25 drawing, wherein 

- figure 1 is a block diagram of architecture for 
determining model parameters related to downloading web 
pages in a network such as a GPRS or XJMTS, 

- figure 2 is a block diagram of architecture for 
30 predicting download times, 

- figure 3 is a flow-chart representing in- the - 
field measurements and calculation of model parameters, 
and 

- figure 4 is flow-chart representing the process 
35 of estimating download parameters. 
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Detailed description of a preferred embodiment of 

the invention. 

In the diagram of figure 1, reference I generally 
denotes a wide area network such as the Internet, while 
5 reference N represents a network, such as a mobile 
telecommunications network, adapted for providing 
access to the network I. Exemplary of the network N 
are, for instance, a GPRS or a UMTS network. 

Reference 10 denotes a mobile terminal such as a 
10 mobile GPRS /UMTS terminal used primarily as means for 
conveying data (that is essentially as a modem) . 

Reference 12 is a processing unit such as a 
computer configured for in the field measurements. The 
processing unit 12 is typically a personal computer 
15 (PC) such as a '^laptop'' portable computer adapted to be 
connected to the mobile terminal 10 to access the 
Internet I via t.he network N. 

The unit 12 is configured (in a manner known per 
se) in order to perform a set of measurements 
20 including: 

- throughput measurement, 

- RTT (round trip time) , 

- download times of selected web pages- 
Reference 14 denotes a server terminal facility 

25 comprised by one or more servers adapted to be accessed 
via the Internet and containing reference files used 
for carrying out measurements of the network 
parameters. Those files may be comprised e.g. of HTML 
pages with given formats and sizes. 

30 Connection of the reference server (s) to the 

network N and to the databases associated therewith (to 
be described in greater detail in the following) takes 
place via respective routers designated Rl and R2 . 

The throughput measurement tool provided in the 

35 computer 12 is adapted to measure throughput by 
downloading corresponding files from the reference 
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server: typically, this may be a HTTP client 
downloading a single HTML file. The results are stored 
by writing them as database items in a measurement 
dateibase 16 . 

5 Similarly, the RTT measurement tool installed in 

the computer 12 carries out RTT measurements towards 
the reference server (s) 14. Typically, RTT is measured 
by using a method similar to the method used by the 
PING command in operating systems (OS) . The results are 

10 again stored as items in the database 16, 

Reference 18 denotes another database including 
items comprising a list of sample web pages. This is 
essentially a database including a list of a relative 
small set of web pages intended to be used for 

15 selecting an optimised model to be sxobsequently used 
for evaluation (i.e. estimation or prediction) purposes 
with reference to a generally much broader set of web 
pages . 

The set of sanple pages is chosen in such a way 
20 that the sample pages represent in a statistically 
meaningful manner the types of pages for which download 
performance is to be predicted. For instance, the 
sample pages in question can be selected as the 
homepages of 100 most frequently accessed web sites in 
25 a certain area. 

As already indicated, the measurement database 16 
includes the results of those measurements carried out 
on the network N (essentially throughput and RTT) for 
each web page in the set of sample pages. 
30 For each sample page subject to measurement^ the 

following items are usually collected and stored: 

- the page URL, 

- the size of each object therein, 

- the start time and the end time of downloading 
35 each object, 

- the total download time. 
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- the throughput and the RTT of the network at the 
terminal during the time interval where the measurement 
was carried out (the time interval is. chosen 
judiciously in such a way that no appreciable 

5 variations take place in the network parameters while 
measurements are being carried out) , 

- type and version of the browser used. 

After being populated, the database 16 is used for 
calibrating the free parameter (s) in the evaluation 
10 (i.e. estimation or prediction) model. 

As indicated, such a model may typically comprise 
the sum of; 

at least one first factor determined 
analytically on the basis of network and web page 
15 parameters, and 

- a second factor being a function, preferably of 
the hyperbolic type, of an optimisation parameter. 

Such a model is typically represented by a 
relationship of the type: 
(nd\ (nh ^, 

where t is the total download time of the page, n 
is the number of objects therein, d is the average size 
of these objects, b is the throughpiSb of the downstream 
link (downlink) , h is the dimension of the HTTP 

25 headers, 1 is the network RTT. 

For a given set of values for n, d, b, h, and 1 
the relationship in question does in fact represent a 
class or set of models, the various models in the set 
being characterized by a respective value of the 

30 parameter X. 

Calibrating the free parameter (s) in the 
evaluation model on the basis of the sample web pages 
essentially requires identifying a value for the 
parameter X that corresponds to an '"optimum" model, 

35 i.e. a model best matching the input -to-output 
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relationships that are actually measured in respect of 
the sample web pages. 

Those of skill in the art will promptly appreciate 

that : 

5 - the models out of which the -optimum" model is 

selected (based on the measurements carried out on the 
sample web pages) may in fact correspond to a plurality 
of different relationships, including heuristic models, 
and 

-LQ . the ""free" parameters involved in the 

optimisation process may be any number, and not just 

one (i.e. X) as in the exemplified case. 

Obviously, increasing the number of parameters 

involved in the optimisation process, will lead to a 
15 more complex, resource- and time-consuming optimisation 

process . 

The experiments carried out by the Applicants have 
however shown that, at least insofar as existing GPRS 
networks are concerned, the simple relationship (I) 
20 reported in the foregoing and involving only one ^^free" 
parameter (namely X) leads to quite satisfactory 
results. 

In general, different types of models are used for 
evaluating download times and efficiency indexes for 
25 different types of network 

The model to be actually used for a specific case 
will be selected depending on the type of network 
considered. 

In fact, each model includes approximations that 
30 apply only for certain network types. Consequently, it 
is necessary to measure certain network parameters 
(essentially the available bandwidth and the RTT) and 
then select on the basis of pre -determined thresholds, 
the model best suited for determining the download 
35 times of HTTP pages on such a network. 
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The measurement tool for the download time 
provided in the processing unit 12 measures the time 
needed for downloading a given web page by reproducing 
the behaviour of certain predefined type of web 
5 browser, for instance Internet Explorer ®. 

As its input, the tool in question accepts a list 
of web pages to be downloaded. As its output, for each 
web page, the following data are provided: 

- total download time, 

10 - dimension of each object downloaded, 

- start and end times of downloading each object 
downloaded. 

The results of measurements are stored in the 
database 16. 

15 In the presently preferred embodiment, a specific 

tool (currently available with the applicant as BMPOP) 
is used for downloading pages and deriving the 
respective download times in co-operation with a 
^'sniffer" for obtaining the dimensions and the download 

20 start and end times for each object. 

. In the diagrams of figure 1 and 2, reference 20 
designates the database comprising data base items that 
define the model to be optimised for predicting the 
download time of a given web page. 

25 As its input, the database 20 accepts data such as 

network throughput and RTT, the dimensions of each 
object included in the web page and one or more free 
parameters defined experimentally. As its output, the 
database 20 provides the download time for a given page 

30 and/or its efficiency f.actor, as defined previously. 

For instance, this may occur (with reference to 
existing GPRS networks) on the basis of the 
relationship (I) considered in the foregoing, where: 

- t is the total download time of the page (i.e. 

35 the output of the model) , and 
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- n is the number of objects in the page, d is the 
average size of its objects, b is the throughput 
available in the downstream link, h is the dimension of 
the HTTP headers, 1 is the network RTT {i-e. the set of 

5 input data to the model) , and 

- A. is a factor (parameter) to be established 
experimentally to identify the ^^optimum" model to be 
used for evaluation purposes. 

Reference 22 denotes a further database (that in 

10 fact may be incorporated with the database 20) 
including the optimum parameters for the models 
computed for each combination of network parameters and 
browser type. 

In fact, the arrangement shown herein lends itself 

15 to be operated in such a way that the optimum 
parameter (s) - e.g. X - are deteinnined for a given 
model type and for a given network type by measuring 
the download times of the set of sample pages and then 
obtaining the best value for the parameter (s) , that are 

20 stored in the database 22. Then the optimum 
parameter (s) - e.g. X - are determined for one or more 
network types in respect of given models (identical or 
different from the one considered) , by measuring the 
download times of the set of sample pages and then 

25 obtaining the best value for the parameter (s) in the 
database . 22 . The database 22 is thus populated with 
different optimum values to be used for evaluating 
tasks related to different types of networks. 

Optimisation of each model for a given type of 

30 network (e.g. finding the value of X that identifies 
the optimum model in the set expressed by the 
mathematical relationship (I) referred to in the 
foregoing) is performed by an optimiser module 24. 
Input data to the module 24 are preferably: 
35 - the type of model to be used (e.g. the 

relationship (I) repeatedly cited in the foregoing) , 
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- throughput and. RTT of the network considered 
(e.g.. '^b*' and ^l'' in the relationship (I)), 

* list of the web pages, 

- for each page in the list: the start and end 
5 downloading times (whose . difference is the parameter 

^^t" in the relationship (I)) and the dimensions of .all 

the objects comprising such a page {e,g, ^"n" , "^h" and 

*^d" in the relationship (I)). 

The output of the module 24 is comprised of the 
10 optimum value (s) for the free parameter (s) of the model 

being used ((e.g. ^X* in the relationship (I)). 

Specifically, the module 24' operates by allotting 

a given value to the or each free parameter in the 

model (e.g. ^^X") and then computing the download times 
15 of the pages in the sample set on the basis of such 

value . 

A comparison is then made with the corresponding 
download times as measured experimentally and a 
^^global" error is then computed as a function of the 
20 ^^partial" errors for each page. 

This may be done by resorting to statistical 
criteria such as e.g. the root means square (RMS) error 
or the peak value of the signal-to-noise ratio (PSNR) . 

The value of the free parameter (s) is then varied 
25 searching for the minimum of the global error. 

This result is preferably achieved in a numerical 
manner, e.g. by means of a standard numerical method 
(e.g. steepest descent) aiming at minimising the global 
error or reducing it below a predetermined value. 
30 Preferably, the server (s) 14 as well as the 

databases 16, 18, 20, 22 and the module 24 are jointly 
■ configured in the form of a local area network (IAN) . 

The databases 20 and 22 are intended to co-operate 
with additional databases and other modules in 
35 evaluating the download performance for a given set of 
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web pages on the basis of the optimum model identified 
in the foregoing. 

In figiire 2, reference 26 denotes still another 
.database including the statistical characteristics of a 
5 list of web pages for which download performance is to 
be evaluated. 

Specifically. for each web page the following 

items are stored: 

- URL, 

10 - list of the objects comprised in the page, 

- dimension of each object. 

This database is populated by means of a web site 
analyser 28 and is subsequently used for determining 
the download times of the pages contained therein. 
15 The web site analyser 28 is another module adapted 

to derive the characteristics of the web page to be 
used as the input for a download performance predictor 
30. 

The input to the web site analyser 28 is comprised 
20 of a list of web pages to be analysed. 

The output from the analyser 28 is comprised, for 
each page in the input list, of the following items: 

- the list of the object comprising the page, and 

- the dimensions of each object. 

25 Such an output is stored in the database 26- 

Typically, the web site analyser 28 is operated on 
a fast network, thus making it possible to collect 
information concerning a large number of web pages in a 
short time. 

3 0 The predictor 30 is comprised of a module adapted 

for calculating the download time and the efficiency 
index for a given web page without actually performing 
any measurement. 

The predictor 30 is essentially a software module 

3 5 adapted to receive as its input data such as the 
network characteristics, the browser type used and the 
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characteristics of the web page while providing as its 
output the download time and the efficiency index 
evaluated for that page. 

In a preferred embodiment, the predictor 30 
5 accepts as its input the following items: 

- throughput and RTT of the network considered; 

- model and free parameter (s) of the model i.e. 
the '^optimum" model to be used for evaluating the 
download performance by way of prediction, and 

10 - number and dimensions of the object comprised in 

the page- 

The output of the predictor 3 0 is comprised" 
essentially of the predicted download time for the page 
and its efficiency index. 

15 Data pertaining to the characteristics of the page 

are read fromi the web page statistics database 26, the 
parameters to be used are read from the optimised 
parameter database 22 and the results are written into 
a prediction database 32, 

20 The efficiency index referred to in the foregoing 

is preferably determined by resorting to a two-step 
procedure . 

As a first step, the average throughput of each 
web page is computed by dividing t^ie total number- of 
25 bytes therein by the download time. 

Subsequently, the efficiency index is computed as 
the ratio of the web page throughput to the network 
throughput (as measured previously) • 

In the notional absence of protocol overhead, such 
30 an efficiency index would be equal to one. 

The database 32 includes the download times and 
the efficiency indexes evaluated for the web pages 
included in the list of the web pages to be analysed by 
means of the "optimum" model defined previously. 
35 The database 32 is populated by the predictor 

module 30 and it includes the download time sind the 
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efficiency index as evaluated for each web page (and 
for each network type) , on the basis of the optimised 
model. Preferably, for each web page the following 
items are recorded in the database 32 : 
5 • download time, 

- estimated efficiency index, 

- network parameters (throughput and RTT) , 

- model , 

- free parameter (s) of the model as used for the 

10 prediction. 

Consequently, the database 32 will contain (for 
each of the web pages) the expected download time and 
the efficiency index on a given type of network. 

Again, the various blocks shown in figure 2 are 
15 preferably configured in order to co-operate in the 
form of a IAN, In actual fact, the arrangements shown 
in figures 1 and 2 can be regarded as corresponding to 
the same LAN being subsequently re-configured to 
perform two basic processing phases. 
20 Th^se two phases essentially correspond to the 

sequence of steps in the flow- charts of figure 3 and 
figure 4, respectively. 

In the first phase, measurements are carried out 
on the network N in order to determine the 
25 characteristics thereof, while the download times for 
the sample web pages are measured. The results of such 
measurements are used for selecting a preferred 
(^^optimum") model and for setting the free parameters 
(e.g. X) of such a model. In the second phase, the 
30 model performs prediction by using those parameters. 

In the flow-chart of figure 3, starting from a 
step 100, in a step 102 the sample web pages are 
selected and the respective sample files to be used for 
carrying out the network measurements are loaded into 
35 the reference server (s) 14. These files may be 
comprised, for instance, of files of different 
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dimensions to be downloaded via HTTP. The information 
pertaining to the sample web pages thus selected is 
stored in the database 18. 

In a step 104, throughput and RTT are measured for* 
5 the network by accessing the reference server (s) 14. 
These measurements are performed by using the tools 
available in the computer 12 and the respective results 
are written in the measurement database 16. 

It will be appreciated that, while represented in 

10 a sequential fashion, the steps 102* and 104 (as well as 
most of the ensuing steps deriving therefrom) can be 
carried out at least partly simultaneously. 

As indicated, the set of sample web pages stored 
in the database 18 is selected as a statistically 

15 meaningful set comprising, for instance, the home pages 
of the most frequently accessed websites in a certain 
area. Essentially, this choice is carried out in such a 
way that the selected sample web pages will not be 
appreciably different from the web pages to which the 

20 expected predictions will apply. The statistical 

analysis of the san^le web pages is carried out in a 

step designated 106 and the list of the sample web 

pages is given as input to the measurement tool for the 

ft-. 

download times of the web pages. 

25 This tool performs in a step 108 the respective 

measurements of the download times for the sample web 
pages. The page statistics (for instance dimensions and 
number of objects included therein) and the results are 
stored in the measurement database 16. 

3 0 In step 110 a candidate model (such as, for 

instance, the relationship I considered in the 
foregoing) is selected from the model database 20. 

This is preferably done depending on a certain 
predetermined threshold concerning the network 

35 parameters. For instance, if the measurements have been 
performed on a low throughput network, a model is 
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selected where the server processing times are 
neglected. Conversely, if a high throughput network is 
being considered, a model taking into account also 
those processing times will have to be chosen. 
5 In step 112 the optimiser module 24 is activated 

in order to process the data comprised of the 
measurement results and the type of model chosen. The 
purpose of optimisation is to derive one or more 
optimum parameters (i.e. the optimum model) for the set 
10 of sample pages chosen to be memorized in the parameter 
database 22. The optimum parameter (s) thus obtained can 
be s\ibsequently used for predicting, via the module 30, 
the download times for each page on the same type of 
network . 

15 For instance, assuming the model is represented by 

the relationship (I) referred to in the foregoing, the 
purpose of optimisation is to identify an optimum value 
of the parameter X that minimises or reduces below a 
predetermined value the difference (error) between the 
20 download times for the sample pages as actually 
measured and as evaluated by way of prediction using 
the model, respectively. 

Such an optimisation process may be repeated for 
different types of networks for which download 
25 performance is intended to be evaluated, so that 
optimum models can be obtained and stored for different 
types of network to be subsequently analysed. 

The download times predicted are compared to the 
corresponding download times as actually measured for 
30 those pages to define a global error associated with 
the model /parameters under test. 

As indicated, the global error is defined as an 
entity (e.g. MSE, PSNR) indicative of the difference 
between the predicted values and the values measured 
3 5 over the whole set of the sample web pages. 
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The optimisation process is thus of iterative 
nature . 

The step 114 is representative of the calculation 
of the global error on the basis of the optimised 
5 parameters. In fact, the step 114 may be regarded as 
representative of the global error calculated in the 
final iteration of the optimisation process of the step 
112. 

In a comparison step 116 the global error in 
10 question is compared with a fixed threshold adapted to 
be defined empirically. If the comparison test is not 
passed (i.e. the global error is higher than the 
threshold), a substantial likelihood exists that the 
model used is not by itself a correct one: for instance 
15 a model has been chosen that does not take into account 
the processing times at the web server, while a ^^fasf 
(i.e. a high throughput) network is considered, whereby 
the processing times in question must be accounted for 
in the model in order to obtain satisfactory results. 
20 In that case (negative outcome of the step 116) 

the system evolves via a return step 117 to the 
selection step 110, which leads to a new type of model 
being selected. 

The whole "optimisation process is thus repeated 
25 with the aim of obtaining a lower global error. 

A positive outcome of the test of step 116 
indicates that the global error obtained on the basis 
of the optimised model (e.g. in the case of the 
relationship I referred to in the foregoing, an 
30 optimised value for X giving the minimum global error) 
is acceptable. 

The system thus evolves to a step 118, which 
represents the beginning of the second phase 
represented by the flow-chart of figure 4. 
35 In such a second phase, the download performance 

(e.g. the download times and the efficiency indexes) is 
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evaluated for a selected set of pages (use pages) . This 
is done in a non- intrusive manner, by way of 
prediction, using the optimised model defined in the 
previous phase 

5 In a step 120 the data base items comprising the 

list of the selected pages or use pages is read from 
the database 26 and in a step 122, the site analyser 28 
is activated. The site analyser 28 is currently 
operated on a fast network in order to obtain in a 
10 short time statistics data related to a large number of 
pages . 

For each page in the set of the selected web 
pages, the analyser 28 determines the list and the 
dimensions of the respective objects to be determined, 
15 These items are memorised in the statistics database 
26. 

In a STibsequent step 124 the predictor 30 is 
activated so that the total download time and the 
efficiency index are determined for each page in the 

2 0 list. The data concerning the pages are read from the 

statistics database 26, while the model /parameters to 
be used by the predictor are read from the respective 
databases 20 and 22. 

The results are stored in the prediction database 
25 32, and the system then evolves to a final step 126, 

The final result is an evaluation of the download 
times (and the efficiency indexes) for a selected 
number of pages among those accessible, and thus 
downloadable, via the network N. 
30 It will be appreciated that the download times 

(and the efficiency indexes) evaluation can be a useful 
tool both for 

- service providers in order to permit them to 
realise web pages having download times corresponding 

3 5 to the user requests; and 
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- network operators in order to permit them to 
know download times in a non- intrusive way. 

While the nviniber is these pages may be very high", 
the download performance data can be evaluated by way 
5 of prediction in a short lapse of time. This takes 
place without interfering in any way with operation of 
the network N by using an optimised model defined on 
the basis of a set of sample pages including a 
relatively small number of sample pages (e.g. the home 
10 pages of the 100 most visited sites) that are 
statistically homogeneous with the pages whose download 
performance is to be evaluated. 

Of course, without prejudice to the underlying 
principle of the invention, the details and embodiment 
15 may vary, also significantly, with respect to what has 
been described by way of example only, without 
departing from the scope of the invention as defined by 
the annexed claims. 
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CLAIMS 

1. A method for evaluating download performance 
of web pages accessible via a network (N) , 
characterized in that it includes the steps of: 
5 - providing at least one model (20) for predicting 

a set of download performance parameters for said web 
pages, said at least one model including at least one 
optimisation parameter (X) , 

- defining (102) a set (18) of sample web pages, 
10 - measuring (104, 108) said set of download 

performance parameters for said sample web pages (18) , 

evaluating (110) said set of download 
performance parameters for said sample web pages (18) 
on the basis of said model (20) for different values of 
15 said at least one optimisation parameter (X) , 

- defining an error indicative of the difference 
between said set of download performance parameters for 
said sample web pages (18) as measured (104, 108) and 
as evaluated on the basis of said model (20) , 

20 respectively, 

selecting (110 to 117) an optimised model 
including a value of said at least one optimisation 
parameter (X) in order to reduce said error below a 
predetermined value, 
25 - selecting (120) a set of use web pages, and 

evaluating (124) said set of download 
performance parameters for said selected set of use web 
pages on the basis of said optimised model, 

2. The method of claim 1, characterized in that 
3 0 said set of download performance parameters includes at 
least one parameter selected from a group comprising: 

- download time for a given web page, and 

- an efficiency index indicative of how said given 
web page exploits the capacity of said network (N) . 
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3. The method of claim 1, characterized in that 
said at least one model includes at least one parameter 
selected from a group comprising: 

- the throughput of said network (N) , 

5 - the roiind trip time (RTT) of said network (N) , 

and 

- at least one of the type and size of each object 
included in said web pages. 

4. The method of claim 1, characterized in that 
10 said sample web pages are selected as a statistically 

meaningful set of the web pages available for 
downloading via said network (N) . 

5. The method of claim 1, characterized in that 
said at least one model is selected by taking into 

15 account at least one threshold related to operational 
parameters of said network (N) . 

6. The method of claim 5, characterized in that 
it includes the steps of providing in said network (N) 
at least one server having a respective processing time 

20 and in that said at least one threshold is a function 
of said processing time- 

7. The method of claim 1, characterized in that 
it includes the steps of: 

- defining, for each sartqple page in said set of 
25 sample pages, a partial error indicative of the 

difference between said set of download performance 
parameters for said sati^le web pages (18) as measured 
(104, 108) and as evaluated on the basis of said model 
(20) , respectively, 
30 - determining, from the partial errors defined for 

each sample page in said set of sample pages a global 
prediction error, 

- selecting (110 to 117) said optimised model 
including a value of said at least one optimisation 

35 parameter (X) minimising said global prediction error. 
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8. The method of claim 7, characterized in that 
it includes the steps of defining said global 
prediction error as one of a mean value (RMS) and a 
peak value (PSNR) of the partial errors defined for 

5 each sample page in said set of sample pages, 

9. The method of claim 1, characterised in that it 
includes the step of providing different types of said 
at least one model (20) for different types of said 
network (N) - 

10 10. A method of evaluating download times of web 

pages accessible via a network (N) , characterized in 
that it includes the steps of evaluating said download 
times on the basis of at least one model comprising a 
module for evaluating the sum of: 
15 - at least one first factor determined 

analytically on the basis of network (b, 1) and 
web page (n, d, h) parameters, and 
- a second factor being a function of an 
optimisation parameter (X) . 
20 11, The method of claim 10 characterised in that 

said second factor is a function of hyperbolic 
type. 

12. The method of claim 3 or claim 10, 
characterized in that said at least one model (20) 

25 corresponds to the following relationship: 

where t is the total download time of the page, n 
is the number of objects therein, d is the average size 
for its objects, b is the downstream throughput, h is 
30 the dimension of the HTTP headers, 1 is the network 
round trip time (RTT) and X is said at least one 
optimisation parameter, 

13. A system for evaluating download performance 
of web pages accessible via a network (N) , 

35 characterized in that it includes: 
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- first data base items (20) defining at least one 
n»odel for predicting a set of dovmload performance 
parameters for said web pages, said at least one model 
including at least one optimisation parameter (X) , 

5 - second data base items defining a set (18) of 

sample web pages, 

- measiaring tools (12, 14) for measuring (104, 
108) said set of download performance parameters for 
said sample web pages (18) / 

10 - a predictor (30) for evaluating (110) said set 

of download performance parameters for said sample web 
pages (18) on the basis of said model (20) for 
different values of said at least one optimisation 
parameter (X) , 

15 - an optiraiser module (24) for defining an error 

indicative of the difference between said set of 
download performance parameters for said sairple web 
pages (18) as measured (104, 108) and as evaluated on 
the basis of said model (20) , respectively, said 

20 optimiser module (24) being configured for selecting 
(110 to 117) an optimised model including a value of 
said at least one optimisation parameter (X) able to 
reduce said error below a predetermined value, 

- third data base items (26T indicative of a 
25 selected set of use web pages, 

- said predictor (30) being configured for 
evaluating (124) said set of download performance 
parameters for said selected set of use web pages on 
the basis of said optimised model. 

30 14. The system of claim 13, characterized in that 

said set of download performance parameters includes at 
least one parameter selected from a group comprising: 

- download time for a given web page, and 

- an efficiency index indicative of how said given 
35 web page exploits the capacity of said network (N) . 
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15. The syatetn of claim 13, characterized in that 
said at least one model includes at least one parameter 
selected from a group comprising: 

- the throughput of said network (N) , 

5 - the round trip time (RTT) of said network (N) , 

and 

- at least one of the type and size of each object 
included in said web pages. 

16. The system of claim 13, characterized in that 
10 said second data base items (18) is configured for 

defining sample web pages comprising a statistically 
meaningful set of the web pages available for 
downloading via said network (N) . 

17. The system of claim 13, characterized in that 
15 said optimiser module (24) is configured for: 

- defining, for each sample page in said set of 
sample pages, a partial error indicative of the 
difference between said set of download performance 
parameters for said sample web pages (18) as measured 

20 (104, 108) and as evaluated on the basis of said model 
(20) , respectively, 

- determining, from the partial errors defined for 
each sample page in said set of sample pages, a global 
prediction error, 

25 - selecting (110 to 117) said optimised model 

including a value of said at least one optimisation 
parameter (\) minimising said global prediction error. 

18, The system of claim 17, characterized in that 
said optimiser module (24) is configured for defining 

30 said global prediction error as one of a mean value 
(RMS) and a peak value (PSNR) of the partial errors 
defined for each sample page in said set of sample 
pages . 

19. The system of claim 13, characterised in that 
3 5 said first data base items (20) are representative of 
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different types of said at least one model (20) for 
different types of said network (N) - 

20. A system for evaluating download times of web 
pages accessible via a network (N) , characterized in 

5 that it includes data base items (20) defining at least 
one model for evaluating said download times, said 
model comprising a module for evaluating the sum of: 

- at least one first factor determined 
analytically on the basis of network (b, 1) and 

10 web page (n, d, h) parameters, and 

- a second factor being a function of an 
optimisation parameter (X) . 

21. The system of claim 20 characterized in that 
said second factor is a function of hyperbolic 

15 type. 

22. The system of claim 15 or claim 20, 
characterized in that said at least one model (20) 
corresponds to the following relationship: 

2 0 where t is the total download time of the page, n 

is the number of objects therein, d is the average size 
for its objects, b is the downstream throughput, h is 
the dimension of the HTTP headers,^ 1 is the network 
round trip time (RTT) cuid X is said at least one 

25 optimisation parameter. 

23 . A computer program product directly loadable 
into the memory of a computer and including software 
code portions for performing the steps of any of claims 
1 to 12 when the product . is rtan on a computer . 

30 
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